Natural Language Processing in Biomedicine: A Unified System Architecture Overview
نویسندگان
چکیده
In contemporary electronic medical records much of the clinically important data-signs and symptoms, symptom severity, disease status, etc.-are not provided in structured data fields but rather are encoded in clinician-generated narrative text. Natural language processing (NLP) provides a means of unlocking this important data source for applications in clinical decision support, quality assurance, and public health. This chapter provides an overview of representative NLP systems in biomedicine based on a unified architectural view. A general architecture in an NLP system consists of two main components: background knowledge that includes biomedical knowledge resources and a framework that integrates NLP tools to process text. Systems differ in both components, which we review briefly. Additionally, the challenge facing current research efforts in biomedical NLP includes the paucity of large, publicly available annotated corpora, although initiatives that facilitate data sharing, system evaluation, and collaborative work between researchers in clinical NLP are starting to emerge.
منابع مشابه
Semantic Interpretation for the Biomedical Research Literature
Chapter Overview Natural language processing is increasingly used to support biomedical applications that manipulate information rather than documents. Examples include automatic summarization, question answering, and literature-based scientific discovery. Semantic processing is a method of automatic language analysis that identifies concepts and relationships to represent document content. The...
متن کاملNECLA at the Medical Natural Language Processing Pilot Task (MedNLP)
This paper gives an overview of NECLA’s submitted systems for the De-Identification and Complaint & Diagnosis subtasks of the Medical Natural Language Processing Pilot Task (MedNLP)[5]. Our systems combine features derived from Part of Speech (POS) tags, a domain-specific dictionary, the Unified Medical Language System (UMLS) metathesaurus and semantic network, and a small set of heuristics bas...
متن کاملEdite - A Natural Language Interface to Databases A new dimension for an old approach
This article presents the Edite system, a Natural Language Interface for Databases (NLIDB), that tries to explore the advantages of joining natural language processing with the expressiveness of graphical interfaces. In order to guarantee a permanent adaptation of this type of solution to a dynamic domain one should consider two critical fundamental factors: extensibility and portability. An ov...
متن کاملIntegrating Natural Language Processing and Biomedical Domain Knowledge for Increased Information Retrieval Effectiveness
Underspecified semantic structures serve as the basis for indexing terms for information retrieval. Biomedical semantic types from the National Library of Medicine’s Unified Medical Language System® constrain coordinate structures to increase the accuracy of the semantic representation. Preliminary experiments conducted on 3,000 MEDLINE titles and abstracts indicate that the approach contribute...
متن کاملDetermining Prominent Subdomains in Medicine
We discuss an automated method for identifying prominent subdomains in medicine. The motivation is to enhance the results of natural language processing by focusing on sublanguages associated with medical specialties concerned with prevalent disorders. At the core of our approach is a statistical system for topical categorization of medical text. A method based on epidemiological evidence is co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Methods in molecular biology
دوره 1168 شماره
صفحات -
تاریخ انتشار 2014